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USE: OF ON LINE 


DATA. BANK FOR INFORMATION RETRIEVAL 


INTRODUCTION 


The basis of all scientific work, in the tobacco or in any other 
field, is a complete revue of what has been done before. It is 
then very important to> have access to good tools for litterature 
survey. 


We all! know how this is. difficult ,, and speclaly in the tobacco 

\ 

field where informations are scattered in' a number of. magazines. 
What possibilities can use research people : 


- Or their company or institute has. a library. It must be of a 
very' large size to catch all information,, and this Is expensi¬ 
ve, in surface as in cost of magazines, 

- Or they are lucky to work near from a university which has 
a large library, 

- Or they have access to data bank terminals through which' they 
can be In contact with most of the Information' stored in li¬ 
braries in the entire world, and: this is the subject I would 1 
like to talk to= you about today. 


...A.. 
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Among the numerous tools that computers are giving to us 
nowadays, there is one of direct interest for scientists : 
on-line databanks for information retrieval* 


A databank is a computer system which gives access to informa¬ 
tion. Generally speaking, there are two types of databanks : 


1 - Documentation databanks - They contain litterature referen¬ 
ces about various subjects, from Chemistry (Chemical Abs¬ 
tracts, for example) to economy (Economical Abstracts) 1 or 
patents (INPI, PATSEARCH). 

Some databanks can be specialized ini a specific subject 
like paper OPIRA) or coffee (COOFEELINE). 

In theory, these databanks guarantee a complete survey of 
the information available in a given field. 

The user can be' confident that specialists have stored ini 
the databank all papers published in' a given list of maga¬ 
zines. 


2 - Facts databanks - This second type of-databanks; gives facts, 
for example thermodynamical values, meteorological infor¬ 
mations, commercial statistics or information on financial 
relationships between companies. 
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Those databanks answer an always increasing demand for informa¬ 
tion retrieval. They allow the replacement of large surface 
libraries storing collections of periodicals. 


By the end of 1932, it was possible to have access in' France, 
to 340 1 different databanks. By the middle of 19 83 , a new list 
of 45 databanks has been added. 


How to communicate with' a databank ? 


The process is simple when you look at it as a whole. When 1 you> 
get to details it is a little more complicated. 


§ 



The PRODUCER collects, screens, updates and analyse informa¬ 
tions linked with the subject. He runs a computer treatment 
and prepare a computer tape which he forwards at regular inter¬ 
vals to the ONLINE: SERVICE VENDOR. 


The ONLINE SERVICE VENDOR owns a powerful computer. He loads, 
and stores on this computer the databanks obtained from the 
PRODUCER. He assures the good run of the questionningi computer 
and of the software. 
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Some example of PRODUCERS and ONLINE SERVICE VENDORS : 


CHEMICAL ABSTRACTS Is the PRODUCER of a databank called 
CA SEARCH. 


This databank is loaded on computers belonging to diffe¬ 
rent ONLINE SERVICE VENDORS like : 

. DIALOG , of Lockheed Missiles and Space Company in 
Palo Alto ; ; 

. SDC , of System Development Corporation, USA ; 

. INFOLINE, of Pergamon. in the U.K. ; 

. ESA-IRS , of the European Space Agency in Italy ; 
QUESTEL , of Telesystem in France. 


Between the computer terminal owned by the USER and the ONLINE 
SERVICE VENDOR computer,, one needs a wire which put into con¬ 
tact two telephone stations, wherever they are ini the worlds 


However the cost of long distance calls rises strongly the 
price of the communication. TELECOM services in many countries 
have decided to set up specialised networks to carry over In¬ 
formation between computers and terminals. The use of these 
networks: is considerably cheaper that ’’normal" long, distance 
calls. 
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The best known NETWORKS are ; 


- in France 

- in Europe 

- in the USA 

- in South Africa 

- in Brazil 


i: TRANSPAC 
: EURONET 

: TYMNET or TELENET 
: SAPQNET 
: INTERDATA 


New networks are created at a high rate. This will make easier 
the access of users to databanks and contribute to their deve¬ 
lopment. 


Finally,, the USER,, toi get in touch with, the computer of the 
ONLINE SERVICE VENDOR needs a terminal. This can be either a 
keyboard-screen, a keyboard-printer or a keyboard-screen-printer 
combinationi. This terminal is equiped with an acoustic or elec¬ 
tric modem, to be connected with the telephone line. 


§ 


How does the system work ? 


- The USER has a terminal and a telephone line, 

- He asks the Network to supply him. with an identification' 
number (NUIi) . 

- He signs a contract with' the ONLINE SERVICE VENDOR who 
gives him a confidential access number to his computer. 

- To question a databank,, the USER operates as follows : 

.: Dial the 1 phone number of the NETWORK, then the terminal 
is. connected! and all the dialog is now made from the 
keyboard. 
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. Type then in: order : 

- the phone number of the ONLINE SERVICE VENDOR, 

- the identification number (NUI), 

- the confidential access number to the ONLINE SERVICE: 
VENDOR, 

- the code name of the desired databank. 

Everything is now ready for questionning the databank : 

Let us take an example : 

We would like to run a litterature survey on the analysis of 
carbon monoxide in> cigarette smoke. 

We have decided 5 to question CHEMICAL ABSTRACTS loaded on ESA-IRS 
computer. 

We: have dialed the Network TFANSPAC: Number,, the ESA-IRS phone 
number, one NUI and one confidential password!. The computer 
accepts the connection and gives the starting time. 

We ask to question CHEMICAL. ABSTRACTS Databank, and will use 
keywords to obtain one information. 

For each keyword: the computer will answer the number of 
appearance in the databank. For each of the following keywords 
the answer is :: 

- carbone 

- monoxide 

- smoke 

- cigarette 

- analyse 


. . ./.. . 


answer : 250362 appearance 
28709 appearance 
6955 appearance 
494 appearance 
320746: appearance 
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Then we combine the five keywords together looking for papers 
having all the keywords at the same time. Computer answers 
six -papers. 

We ask to look at the titles and reference of the six articles 
(see .Annex 1):. 

Interrogation is then, ended and the communication disconnected. 


Let us talk about cost 


The price of a terminal with' screen, printer and modem is in 
the range of 15 OOOi to 30 000 FF. This can be rented for around 
1 000 FF per month. 

\ The use of the NETWORK costs from 30 FF to. 40 FF an hour. 

The use of a. databank, invoiced by the ONLINE SERVICE. VENDOR 
costs between. 300 and 400 FF per hour. 

The incoming on the market of small terminals of the type 
MINITEIL, rented by the telephone services for around 70 FF 
g per month will considerably increase in the near future the 

number of users of databanks. 

International Networks 

The number of countries having access to each other through 
NETWORKS specifically for computer communication is increasing 
every month and will soon cover the largest part of the world. 

The annexed list (Annex 2); gives the today situation of 
Networks connecting: France with other countries. 

Among the members of CORESTA, 117 on 136 will belong to coun¬ 
tries interconnected with. France and Europe by the end of 1954. 
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LANGUAGES 

Each ONLINE SERVICE VENDOR has developped its own language to 
load the databanks on its computer. The USER needs to have a 
certain natural talent to< cope with the different languages. 


If you want to question, from a European country, most of 
the databanks available around the world, you need, besides 
English, to know at least 4 languages : 

- DIALOG ('Lockheed) 

- ORBIT (SDC) 

- REGON (ESA-IRS) 

- MISTRAL (QUESTEL) 


These languages have many commun features, but,.and this Is 
more cumbersome, a. number of small differences.. 


ONLINE SERVICE VENDORS propose training! sessions and handbooks 
to learn the languages.. However we must say that there is a 
need for some practise before 1 a language can be mastered, in 
order not to loose time during an 1 interrogation'. 


HOW TO SELECT THE DATA BANK 

Databanks are a relatively new phenomenon, but many databanks 
are already available. There is a business 

competition between different ONLINE SERVICE VENDORS. Each of 
them tries to offer the most complete and complementary, range 1 
of databanks so as to gain regular customers, who will appre¬ 
ciate to avoid the use' of different languages at one time. 
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The beginner user will start be questionning different ONLINE 
SERVICE VENDORS offering similar possibilities. Sooner or later 
he will find that some databanks axe richer in information' than 
others and rather than to question different databanks on 
one given subject, he will systematically call on the richest. 

As far as tobacco is concerned, the databank produced by 
CHEMICAL ABSTRACTS t available through several ONLINE SERVICE 
VENDORS, is , second our mind, the richest currently availa¬ 
ble, at least for technology and chemistry aspects. But 
Agronomy is not as well covered. 

To illustrate this question, I decided to run the following 
exercise : I took litterature references in CORESTA bulletins 
and in Tobacco Abstracts ;; and we tried to retrieve them in 
different databanks. 

The title of the first page was : 

"Chemical studies; on tobacco smoke" 

LI : Studies on non volatile Nitrosamine in tobacco 

Authors : S.S. HECHT .... and T.C. TSO 

Reference : Beitrage sur Tabakforschung _9 1-7. (1977) 

Different databanks were questionnedusing the following 
keywords : 

- Tobacco' 

- Smoke 

- Nitrosamine 

- Non volatile 
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We only found this paper in CHEMICAL ABSTRACTS databank, 
using the, ONLINE SERVICE VENDOR "ESA-IRS". 

In this bank, 4 references corresponded to the combination of 
the 4 keywords,: and among them, the paper which we were loo¬ 
king for. 

We operated in the same way for a second paper . 

a 

Title : Biochemical Changes in leaf pigments and chemical 
constituents during; Flue-curing; of tobacco. 

Authors : A. GOPALAM and N.C. GOPALACHARI 

Reference : Tobacco Research S 117-124 (1979) 

We combined 4 keywords : 


- Biochemical 

- Changes 

- Leaf 

- Pigment 

Again, we only found this, paper in CHEMICAL ABSTRACTS data¬ 
bank where 4 papers corresponded to the 4 keywords. 

But we were unable to find trace of several papers on tobacco, 
published: in a Canadian magazine "THE LIGHTER" or in Japanese 
magazines. 
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CONCLUSION 

The first objective of this presentation was to describe for 
those of the audience who were not already familiar with it, 
a new information retrieval technique through the use of 
online databanks. 

My real objective was to lead you to ask you the following 
question^ : 


- Do you think that the creations of a databank, which could 
be called CORESTA databank could be useful to you ? 

We all use the excellent documentation tool which is the 
\ CORESTA Bulletin, and I do not suggest that we suppress 

this tool. On the contrary, I propose a complementary tool 
which will contain everything already published in CORESTA 
Bulletins since its beginning, and completed, eventually by 
other informations, incompletely covered now. 

g A CORESTA databank could, also supply facts, for example the 
survey of BLUE MOLD: progress. 

I am of course aware that my proposal represents a very heavy 
task and that we may not have the ressources. 

What I suggest today is, as a first step, to set up a task 
force, in the Technology group which will run a feasability 
study on the development of a CORESTA databank. 

We will obviously consult other associations, comparable to 
CORESTA, which have already decided to go in this direction. 


For example : INTERNATIONAL COFFEE ORGANIZATION 

in LONDON 


RESEARCH ORGANIZATION 1 FOR; THE: PAPER AND 
BOARD' (PIRA} in. the U.K.. 
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